Generalization of the Dynamic Ordering for the One-Sided Block Jacobi SVD Algorithm: II. Implementation

نویسندگان

  • Martin Bečka
  • Marián Vajteršic
چکیده

We have designed, implemented and tested (by simulation on a serial computer) the new dynamic ordering for the parallel one-sided block-Jacobi SVD algorithm. Our idea is based on the estimation of the cosines of principal angles between two block columns X and Y of the same width without explicitly forming the matrix product XY (or Y X) and computing its SVD. Instead, we propose to use a fixed number 2q of iterations in the Lanczos algorithm applied to the symmetric 2x2 block Jordan-Wielandt matrix with zero diagonal blocks, 21-block XY and 12-block Y X; the order of the Jordan-Wielandt matrix is the sum of the block column widths. However, the matrix blocks XY and Y X are never formed explicitly; the needed matrix-vector multiplications are computed exchanging intermediate product vectors between two processors that host the block column X and Y . After computing 2q iterations, the Frobenius norm of an auxiliary tridiagonal matrix of order 2q estimates the square root of twice the sum of squares of q largest cosines (representing q smallest principal angles) between X and Y . In the parallel algorithm using p processors, these weights can be used for choosing p pairs of block columns, which are far from orthogonality with respect to those q smallest angles. We show how to implement this new parallel ordering in the distributed paradigm of parallel computing using the Message Passing Interface (MPI). First numerical results obtained by simulation show that the one-sided parallel dynamic ordering can lead to a substantial decrease of the number of parallel iteration steps needed for the convergence as compared to a cyclic ordering.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Ordering for the Parallel One-sided Block-jacobi Svd Algorithm

The serial Jacobi algorithm (either one-sided or two-sided) for the computation of a singular value decomposition (SVD) of a general matrix has excellent numerical properties and parallelization potential, but it is considered to be the slowest method for computing the SVD. Even its parallelization with some parallel cyclic (static) ordering of subproblems does not lead to much improvement when...

متن کامل

Generalization of the Dynamic Ordering for the One-Sided Block Jacobi SVD Algorithm: I. Analysis and Design

The efficiency of the one-sided parallel block-Jacobi algorithm for computation of the singular value decomposition (SVD) of a general matrix A ∈ Rm×n, m ≥ n, depends–besides some numerical tricks that speed-up the convergence–crucially on the parallel ordering of subproblems, which are to be solved in each parallel iteration step. We discuss in detail possible generalizations of the so-called ...

متن کامل

Parallel One-Sided Block Jacobi SVD Algorithm: II. Implementation

This technical report is devoted to the description of implementation details of the accelerated parallel one-sided block Jacobi SVD algorithm, whose analysis and design was described in [21]. We provide discuss a suitable data layout for a parallel implementation of the algorithm on a parallel computer with distributed memory. This discussion is complicated by the fact that different computati...

متن کامل

Parallel One - Sided Block - Jacobi Svd Algorithm

A new dynamic ordering is presented for the parallel one-sided block Jacobi SVD algorithm. Similarly to the two-sided variant, which has been analyzed and implemented in last 10 years, the dynamic ordering takes into account the actual status of a matrix—this time of its block columns with respect to their mutual orthogonality. Using p processors, in each parallel iteration step the p mostly in...

متن کامل

Parallel Svd Computation in Updating Problems of Latent Semantic Indexing ∗

In latent semantic indexing, the addition of documents (or the addition of terms) to some already processed text collection leads to the updating of the best rank-k approximation of the term-document matrix. The computationally most intensive task in this updating is the computation of the singular value decomposition (SVD) of certain square matrix, which is upper or lower triangular, and conta...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008